Mining Causality Knowledge From Thai Textual Data

نویسندگان

  • Chaveevan Pechsiri
  • Asanee Kawtrakul
  • Rapepun Piriyakul
چکیده

Mining causality knowledge will induce knowledge of reasoning beneficial for our daily use in diagnosis. This framework is for discovering causality existing between causative antecedent and effective consequent discourse units. There are two main problems in causality extraction; causeeffect identification and cause-effect boundary determination. The causeeffect identification problem can be solved by learning verb pairs among different elementary discourse units and learning lexico syntactic pattern (i.e., NP1 V NP2) within a single elementary discourse unit from annotated corpus, by using the Naïve Bayes classifier. WordNet will be used in this learning for providing the concept for the verb pairs and NP pair of the lexico syntactic pattern after translation of Thai words to English words by using the ThaiEnglish dictionary. The cause-effect boundary determination problem can be solved by using centering theory and cue phrase or causality link, where the cue phrase would include the discourse markers and verb phrases. Our model of causality extraction shows precision and recall, of 86% and 70% respectively. Our evaluation is based on the expert’s results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Kansei engineering and data mining in the Thai ceramic manufacturing

Ceramic is one of the highly competitive products in Thailand. Many Thai ceramic companies are attempting to know the customer needs and perceptions for making favorite products. To know customer needs is the target of designers and to develop a product that must satisfy customers. This research is applied Kansei Engineering (KE) and Data Mining (DM) into the customer driven product design proc...

متن کامل

Learning to Predict from Textual Data

Given a current news event, we tackle the problem of generating plausible predictions of future events it might cause. We present a new methodology for modeling and predicting such future news events using machine learning and data mining techniques. Our Pundit algorithm generalizes examples of causality pairs to infer a causality predictor. To obtain precisely labeled causality examples, we mi...

متن کامل

ارائه مدلی برای استخراج اطلاعات از مستندات متنی، مبتنی بر متن‌کاوی در حوزه یادگیری الکترونیکی

As computer networks become the backbones of science and economy, enormous quantities documents become available. So, for extracting useful information from textual data, text mining techniques have been used. Text Mining has become an important research area that discoveries unknown information, facts or new hypotheses by automatically extracting information from different written documents. T...

متن کامل

Learning Causality from Textual Data

We present a new methodology for modeling and predicting future events through machine learning and data mining techniques from textual data. Modeled events span across varied domains including politics, economy and society. The model employs human-style prediction techniques such as causality inference, generalization and projection based on past experience. For this purpose, we use news archi...

متن کامل

Text Mining -knowledge Extraction from Unstructured Textual Data

In the general context of Knowledge Discovery, speciic techniques , called Text Mining techniques, are necessary to extract information from unstructured textual data. The extracted information can then be used for the classiication of the content of large textual bases. In this paper, we present two examples of information that can be automatically extracted from text collections: probabilisti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005